Goto

Collaborating Authors

 Patra


A Universal Load Balancing Principle and Its Application to Large Language Model Serving

Chen, Zixi, Bu, Tianci, Song, Chendong, Lu, Xin, Ye, Yinyu, Zhou, Zijie

arXiv.org Machine Learning

Load balancing-the allocation of work across parallel resources to reduce delay, energy and cost-is a pervasive challenge in science and engineering, from large-scale simulation and data processing to cloud and manufacturing operations. Motivated by the emerging bottleneck in large language model (LLM) serving, we study a particularly stringent regime of load balancing that arises in barrier-synchronized, stateful systems: work cannot be freely migrated and progress is gated by the slowest participant at each step, so heterogeneity and temporal drift in workloads create persistent stragglers and substantial idle time. LLM serving under data-parallel decoding provides a prominent modern instance: in production traces, barrier-induced idle can exceed 40% of compute time per decode step. Here we develop a universal load-balancing principle, which admits a step-wise finite-horizon integer-optimization formulation and yields worst-case guarantees: across LLM decode models and a broader class of non-decreasing workload drift processes, it reduces long-run imbalance by a factor that grows with batch size and system scale. Extensive experiments corroborate the theory, showing substantial improvements in throughput and latency together with reductions in energy consumption. These results provide a general, theoretically grounded framework for load balancing, with immediate implications for sustainable LLM serving and broad relevance to other synchronization-gated resource-allocation problems.


Field-programmable dynamics in a soft magnetic actuator enabling true random number generation and reservoir computing

Oliveros-Mata, Eduardo Sergio, Pylypovskyi, Oleksandr V., Raimondo, Eleonora, Illing, Rico, Zabila, Yevhen, Guo, Lin, Mu, Guannan, López, Mónica Navarro, Wang, Xu, Tzortzinis, Georgios, Filippatos, Angelos, Bermúdez, Gilbert Santiago Cañón, Garescì, Francesca, Finocchio, Giovanni, Makarov, Denys

arXiv.org Artificial Intelligence

Department of Mathematical and Computer Sciences, Physical Sciences and Earth Sciences, University of Messina, 8166 Messina, Italy Complex and even chaotic dynamics, though prevalent in many natural and engineered systems, has been largely avoided in the design of electromechanical systems due to concerns about wear and controlability. Here, we demonstrate that complex dynamics might be particularly advantageous in soft robotics, offering new functionalities beyond motion not easily achievable with traditional actuation methods. We designed and realized resilient magnetic soft actuators capable of operating in a tunable dynamic regime for tens of thousands cycles without fatigue. We experimentally demonstrated the application of these actuators for true random number generation and stochastic computing. These findings show that exploring the complex dynamics in soft robotics would extend the application scenarios in soft computing, human-robot interaction and collaborative robots as we demonstrate with biomimetic blinking and randomized voice modulation. A large number of mechanical systems, including simple ones such as the double pendulum, exhibit dynamics characterized by deterministic periodic and chaotic responses depending on the excitation frequency f and amplitude A of the applied force [1]. Mechanical systems with a tendency to chaotisation demonstrate multiple resonances and various transitions to chaos [2]. Today, the concept of complexity and, especially, deterministic chaos that refers to systems without stochastic fluctuations jet losing stability of phase space trajectories is explored for a variety of directions [3] even including biological systems [4] or optics [5]. In particular, chaos is a fundamental aspect of electromechanical systems and is broadly explored in motion planning for mobile rigid robots, fluid mixing, and improving energy harvesting, as well as in mechanisms used in washing machines, dishwashers, and air conditioners [6]. Although the analysis of traditional robotics and mechanisms has revealed inherent chaotic dynamics [7], chaos can also be intentionally generated through nonlinear feedback [6] to achieve specific functionalities. In contrast to rigid mechanisms, soft actuators can facilitate transition into complex dynamics without the need for dedicated feedback algorithms. Mechanically soft actuators do not possess any rigid components in their embodiment rendering them ideally suited to explore complex and even chaotic dynamics which is typically observed at higher frequencies (Supplementary Tables 1 and 2). The inherent nonlinear oscillations emerging in soft actuators for specific parameter values [8, 9] can be applied for secure, biomimetic, and soft computing applications.


Vector Quantized-Elites: Unsupervised and Problem-Agnostic Quality-Diversity Optimization

Tsakonas, Constantinos, Chatzilygeroudis, Konstantinos

arXiv.org Artificial Intelligence

Quality-Diversity algorithms have transformed optimization by prioritizing the discovery of diverse, high-performing solutions over a single optimal result. However, traditional Quality-Diversity methods, such as MAP-Elites, rely heavily on predefined behavior descriptors and complete prior knowledge of the task to define the behavior space grid, limiting their flexibility and applicability. In this work, we introduce Vector Quantized-Elites (VQ-Elites), a novel Quality-Diversity algorithm that autonomously constructs a structured behavior space grid using unsupervised learning, eliminating the need for prior task-specific knowledge. At the core of VQ-Elites is the integration of Vector Quantized Variational Autoencoders, which enables the dynamic learning of behavior descriptors and the generation of a structured, rather than unstructured, behavior space grid -- a significant advancement over existing unsupervised Quality-Diversity approaches. This design establishes VQ-Elites as a flexible, robust, and task-agnostic optimization framework. To further enhance the performance of unsupervised Quality-Diversity algorithms, we introduce behavior space bounding and cooperation mechanisms, which significantly improve convergence and performance, as well as the Effective Diversity Ratio and Coverage Diversity Score, two novel metrics that quantify the actual diversity in the unsupervised setting. We validate VQ-Elites on robotic arm pose-reaching, mobile robot space-covering, and MiniGrid exploration tasks. The results demonstrate its ability to efficiently generate diverse, high-quality solutions, emphasizing its adaptability, scalability, robustness to hyperparameters, and potential to extend Quality-Diversity optimization to complex, previously inaccessible domains.





Improved Coresets and Sublinear Algorithms for Power Means in Euclidean Spaces Vincent Cohen-Addad Google Research, Zurich. David Saulpic Sorbonne Universit e, Paris Chris Schwiegelshohn

Neural Information Processing Systems

Special cases of problem include the well-known Fermat-Weber problem - or geometric median problem - where z = 1, the mean or centroid where z = 2, and the Minimum Enclosing Ball problem, where z = . We consider these problem in the big data regime. Here, we are interested in sampling as few points as possible such that we can accurately estimate m. More specifically, we consider sublinear algorithms as well as coresets for these problems. Sublinear algorithms have a random query access to the set A and the goal is to minimize the number of queries.